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[57] ABSTRACT 

A file-based Read-ahead method employs asynchronous 
I/O processes to fetch Demand and Read-ahead data 
blocks from a disk (4), depending on their physical and 
logical sequentialities. When a file system (10) issues an 
I/O address to obtain data blocks from a file, it also 
determines the disk location of data blocks that are 
logically consecutive with the demaiided data. The 
number of such data blocks to be determined in this 
process is accomplished in accordance with a dynamic 
Read-ahead method. If the data blocks are physically 
consecutive, a single I/O request is sent to a disk driver 
to obtain both the Demand data block and to Prefetch 
the Read-Ahead data block(s). If, mstead, the data 
blocks are found not to be physically consecutive, a 
plurality of asynchronous disk I/O requests are issued 
so as to separately obtain the Demand and the Read- 
Ahead data blocks. 

11 Claims, 4 Drawing Sheets 
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for a significant period of time, thereby reducing the 
ASYNCHRONOUS KEAD-AHEAD DISK CACHING efSdeocy of the disk cache. 
USING MULUFLE DISK I/O PHOCE SSES AND It is thos one object of this invention to provide a 
DYNAMICALLY VARIABLE FREFETCH LENGTH method to improve the efSdency of a data cache, par- 
5 ticulariy a didc cache. 

FIELD OF THE INVENTION It is another object of this mvention to provide a disk 

This mvention relates generaUy to data processing having a prefetch mechanism that is optimized 

systems (hat employ a data cache wherein both demand effidentiy responding to sequential Demands for 

and prefetched data are stored and, in particular, to 

improved methods for prefetching units of data from a SUMMARY OF THE INVENTION 

mass storage device, such as a rotating disk. — . . , 

The foregomg and outer problems are overcome and 

BACKGROUND OF THE INVENTION the objects of the invention are realized by a file-based 

Disk cadimg is a known disk access avoidance tech- Read-ahead method that employs asynchronous I/O 
niquethatmaintainsasetofdataUocksinahighspeed " Pro<»s«i to fetch Demand and R^ad-ah^ 
disk cache memory, with the expectation that some or ^^^^ P^^''^ fi^ sequoitialmes. 

all of tliese data blocks win subsequently be requested system uames an I/O adcfress to obtam data 

(demanded) by a data processor. In additioTto de- ^oni a file, it dso detmnines locations m a mass 

manded data blocks that*Me cached during each disk „ ^vfvSrth. ^i^^ t T"^? "T^T 
access, a read-ahead disk cache determines a number of ^° ^vevaAflie demanded d^^^ 

additional blocks to be fetched into the disk cache, and ^'"^J^'^ determm.^ m this process is accomphdied 
thus eliminates the need of some future disk access to Sj^S^. I Read-^ ""^^^ 

the data. In that a disk access, also known as an Input- "t,^^ ff^ST^^ ^"^f^^X^^ 
/C>utputa/0)Proce^istypicaUyoneormoreo.5^ ^ SS,iri*?.StS^£S^e^'£ J^J^ 

£rmri^^r:L^s.^ss:hn^r^ ^ts^^.i^^^v^'^^'''^ 

appreciated that the ^aintemmce of pref^ed data "Uy^°«cutive, a plurahty of asynchronous dok VO 
. ^ jTT t . . H'wifcUBu ibUB requests are issued so as to separately obtain the De- 
r,^^„?'.ffi^'^' nnprovements Read-Ahead dKS[») 

^3r5^Ltstobeprefetched,thatis.the '""f"*°?'1?'r°?*^n''f '^i^n'Tt 
r * 1. 1 /tT Z . F^^ciA^iicu, uiai. a, uic ^^^^ ^ dynamic adjustment of the Read-ahead Prefetch 

^etch lengtti (L , may be assigned statjaiUy or may Leng^ predetermin^pJeT^d W 

be dymmncaUy adjusted m some &shion based on disk ^ implemented. A Read-ahead di^kcache section 

^S^^' conventional re^ ^ defined Lt contains data block addresses obtained 

^iw^ifc^L^fr*' 35 fro"^ ^ ^ost lecent Read-ahead process to each file. 

^ ^ J^^^T^^^^!^ This defined disk cache section is referred to a Most 

cally consecutive to the demanded data blocks. Smce a Recent Read-ahead Section (MRRS). During the data 

^^^U^T^^Tl''^'^''^^^^^ P^^^S °f ^ ^ sul^equent DemaiS hits the 

t^ty on the didc, Le the data may not r«ade m consec mRRS, the Read-ahead LengA (L) is increased by one 

utwe data blocks, the performance advantage of the 40 for a next access to that file. For a Demand that results 

i«id-ah«id didc cache is r^uced Furthermore, in a disk cache miss, the value of L for that file is re- 

Imown types ofconventional disk cache designs, a read. duced by one. No change is made to L when a Demand 

ahead IS issued only when the demanded data is not file request hits a disk cache location other than the 

located witoi the disk cache, Le., during disk cache mRRS. Presently preferred upper and lower limits for 

misses. No file access process is issued for a disk cache 45 l are four (Lmax) and one (Liifflv). respectively- For 

^^'T'^u^. *u 1 • , . . . , , Li4^, up to four data unit (pages) on the disk are 
FIG. 2a lUustrates the logical organization of a plural- checked for file location for a Read^Oiead Prdetch, and 
1^ of sequential data blocks (B) on a mass storage de- only those pages that are physically consecutive or 
vice, typicaUy a disk. By example, each data block may contiguous are accessed in one Read-ahead I/O Pro- 
repr^t a sector or a track of the disk. The group of 50 cess, with the non-contiguous page(s) being accessed in 
data blocks are associated with a file. By example only, a separate Read-ahead I/O Process, 
the file may contain employee records for an organiza- The teaching of the invention also provides for the 
toon. In response to a Demand for a Block (fid, the issuanceof a Read-ahead I/O Process when a Demand 
Block B, is Fetched, and the blocks B/+i to B/+3 are file request hits the MRRS for the first time, and thus 
Prefetched, FIG. 26.illustrates a physical, as opposed to 55 anticipates a sequential data access to the file. This ap- 
a logical, organization of the. data Blocks. As can be proach provides an initiative, active, and constant 
seen, the Blocks associated with the file, may not be Read-ahead for sequentiaUy accessed files. However, if 
physically contiguous to one another, but may instead the Demand request to a file arrives before the comple- 
be sq>arated by some number of Blocks (designated tion of the last Read-ahead I/O process, no additional 
with an X) that are associated with another file, or that 60 Read-ahead I/O process is issued. As a result. Prefetch 
are defective and, hence, unused. The conventional disk operations are not enqueued and there is, at most, one 
cache mechanism performs one I/O Process to access Read-ahead I/O Process at any given time, 
the Demand Block B, and also the Prefetehed Blocks The teaching of the invention thus provides effi- 
B/+1 to B/+3. Because of disk latencies and/or other dency improvements for a disk cache. Furtheimore, 
disk access characteristics, a significant amount of time 65 the teaching of the invention provides Prefetch optimi- 
may elapse between the acc«s to Blocks (B/ and B/^. 1) zations for purely sequential disk accesses, 
and the Blocks (B/+2 and B/+3). As a result, the De- In accordance with a first process of the invention 
manded Block (fid may not be returned to the requester there is provided a method for operating a data cache 
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that is coupled to a data store. The method includes the FIG. 5 shows an example of the use of the invention 

steps of (a) receiving a Demand for a unit of data from when performing sequential and non-seqoendal ac- 

a requestor; .(b) determining from one or more, data - cesses to the file of FIG. 4; and 

cache entries if the Demanded unit of data is resident in FIG. 6 is a flowchart that illustrates a method of the 

•a data cache; and (c) if the Demanded unit of data is ^ invention. 

detennined to be resident in the data cache, returning DETAILED DESCRIPTION OF THE 
the Demanded unit of data from the data cache to the INVENTION 
requestor. In accordance with an aspect of the inven- 
tion, the step of determining includes the steps of, (d) ^ illustrates in block diagram form a data pro- 
determining from a predetermined one of the data cache cessing system 1 that includes at least one Central Pro- 
entries if the Demanded unit of data is a unit of data that cessing Unit (CPU) 2, a main memory 3, and a mass 
was most recently prefetched into the data cache from f^^S^ ^^^j^ typically embodied in a disk 4. The CPU 
the data store, and, if so, (e) prefetching up to L further \ "^l, include a hi^ speed CPU cache 5 that is bidirec- 
units of data from the data store to the data cache. The J?^"^^ ^^^P^!? ^ f data and confrol signal 
step of prefetching prefetches up to L further miits of S^f f t^f ^.^^ specifically a d^cache 
data that are logicaUy contiguo^ to a hist prefetched ^ ^ ^ ^ 
unit of data, and includes a step of updating the prede- t^^^i 1 ^ ^^-^ 
termined one of the data cache entiirtobidicate the SlS^S^^Sn^^^^ T^aI ^ "t.^ 
.^endtiesoftheLprefe^h^ ^ ^ySlSgS^^^^ 
w'nSJt^" vamblesoas toopumizethepref^^^ ^ 9, including^the File System 10 as required. 
mg of lo^y contiguous^ File System 10 is coupled, via a suitable <tevice driver 
In accordance with a ftirAer process of the mventon (^ot shown) and signal lines 11, to the disk 4 for issuing 
there is provided a method for operatmg a data cache j/q Process spedfieis to the disk 4, The I/O ProcS 
that IS coupled to a mass storage device. This method 25 spedfieis include location information, such as disk 4 
nicludes the steps of (a) receiving a Demand for a unit track CD* sector (S), and volume (V) information. In 
of data from a requestor; (b) determining from one or response to the I/O Process specifiers the disk 4 reads 
more data cache entries if the Demanded unit of data is information from the specified storage region(s) and 
resident in a data cache; and if the Demanded unit of provides the read-out data to the disk cache 7 over the 
data is detennined not be resident in the data cache, (c) 30 data path 8. 

accessing the mass storage device to retrieve the De- In a presently preferred embodiment of the invention 
manded unit of data and one or more additional, logi- the OS program 9 is an operating system known as 
cally contiguous units of data. If the one or more addi- OS/2 (OS/2 is a Re^stered Trademark of the Interna- 
tional units of data are physically contiguous to one tional Business Machines CorporationX having a File 
another and to the Demanded unit of data, within the 35 System that is modified as described in detail below. It 
mass storage device, the step of accessing is accom- should be realized, however, that the teaching of the 
plished with one I/O Process. However, if the one or invention is not limited for use with ^y one specific 
more additional units of data are not physically contigu- ^yp^ of operating system, or with any one specific type 
ous to one another or to the Demanded unit of data, the of data processing system architecture. Neither is the 
step of accessing is accomplished with two or more I/O ^ teaching of the invention limited for use only with mass 
Processes. A first I/O Process fetches the Demanded storage devices that comprise rotating disks. By exam- 
unit of data and all additional units of data that are P^^ ^^y> teaching of the invention could also be 
physically contiguous , with the Demanded unit of data, employed to advantage in a multi-processor data pro- 
ifany, and a second I/O Process prefetches one or more cessmg system having a magnetic tape data storage 
non-contiguous others of the additional units of data. As system. As such, the ensuing description of the invcn- 
a result, the Demanded unit of data is returned to the ^ ^ ^ ^ applying only to a system of a 
requestor before the second VO Process is completed, ^ depicted in FIG. 1, or in the FIG. 3 to be described 
thus providing an increased efSciency-in the operation „^ - . 

of the data cache. In FIG. 3 the mam memory 3 can be seen to include 

30 the disk cache 7 and the File System 10. The operation 

BRIEF DESCRIPTION OF THE DRAWING of the File System 10 is described in the context of 

The above set forth and other features of the inven- ^ ^^"^^1?^. ^ coy^t^ ^d registers. It 

tion are made more apparent in the ensuing Detailed ^"f^ ^ "^^f however, that the File System 10 is 

Description of the Invention when read in conjunction « E'f^J^r^x^^^ a pro^ that is executed 

with the attached Drawing, wherein: ^^^A^^ ^"""^^^ ^ "'f^ ^} "^^^ 

HG. lisablockdiagramof adataprocessingsystem ""jS r^if'S;^^ ^ unjriemtmt aU or some of. 

thatincludesadiskcaSitisoperaKS ^efimcUons of the FUe System 

^SA^^5^«Hl*^^Tf°5 V .A u TheFileSysteml0includesanaddressregisterl2for 

^ FIGS.2^2^and2cshowdatablocksassoaatedwith ^ storing a Demand address (DADDR) that is received 

TTT^ ^ • VI 1 J- 1. .« 'he CPU 2. The Demand address specifies an 

A ^^'^'l\^^'^l^f^ imtialaddressofapage,unit,orblockofdatathatisto 

detail the disk cache of the mvention and also a file be read from a file. 

systm that operates the disk cache; as en^)loyed herein, a file is considered to be an 
FIG. 4 shows an example of the use of the invention 65 identifiable assemblage of data bloc^ the data blocks 

when performing a pure sequential access to a file, the being typically logically related to one another. Each 

file being illustrated in a logical as opposed to a physical file has an associated file descriptor, the file descriptor 

format; providing a sunomary of information regarding the file. 
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such as the nmnber of bytes in the file, the type of infor- At Block F a determination is made if the MRRS 
mation contained within the ffle (test, graphics, etc), HIT is the first HIT to the MRRS la. If NO, indicating 
file security information, and so forth. In a presently that the MRRS 7a has been HIT at least once before by 
preferred embodhnent of the invention the I^e System a previous DADDR, control passes to Block 0 to 
10 includes tables and the like for maintaining the file S EXIT. That is, no prefetching occurs. As a result, only 
descriptor information m a manner known in the art In one Prefetch I/O operatkm occurs for a given MRRS 
odier embodiments of die invention, such as a distrib- ^<^ entry. This prevents a potentially large amount of 
nted file system, the file descriptor information maybe disk data being prefetched In response to multiple De- 
maintained and processed differently. maads for one or more of the data blocks indicated by 

A comparator (OOMP) 14 has a first input coupled to 10 the MRRS 7a. It is noted, however, that multiple HTIs 
an output of the address register 12 and a second input ^ ^ MRRS 7a will result in the Length Counter 16 
coupled to the disk cache (DCACHE) 7. More spedfi- ^)&b% inoremmted multiple times, possibly up to the 
cally, the DCACHE 7 includes a plurality of n»axininm allowed value ^Af^. Thus, the value of (L) 
ENTRIES 7ft each of which stores an address or ad- pan be seen to reflect the degree of sequentiality of the 
dresses associated with one or more blocks of data that incoming Demands. A large value of L indicates a high 
are currently resident within the DCACHE 7. In accor- degree of file data block access sequentiality and, as a 
dance with an aspect of the invention, one of these result, it is advantageous to Prefetch larger numbers of 
ENTRIES is designated as a Most Recently Read- logically contiguous data blocks so as to improve the 
Ahead Section (MRRS) 7a. The comparator 14 com- efficiency of the disk cache 7. 
pares the address(es) associated with the MRRS 7a, and ^ ^ ^ 'Bi<x^ F, mdicating that the present 
also the other ENTRIES lb, to the content of the ad- DADDR is the first hit of the MRRS 7a, control passes 
dress register 12. If the content of the address register 12 *° VSock O to prefetch as many as L next units or blocks 
b found to be stored within one of the ENTRIES lb or *® ®® *° prefetched data blocks 

the MRRS 7a, the conq)aiator 14 issues a DCACHE DCACHE 7. An address of the first one of 

HTT or an MRRS HIT, as appropriate. Otherwise, the ^ "* ^ " *° ^ 

comparator 14 issues a DCACHE MISS indication. L?" r^TZ ^-^J^ (MRRSaKnaifr-J- 1). At 

The HIT and MISS indications are received by a Fde "* "f^^ ^ "fleet the ad- 

System Controfler (CONT) 18 which also receives the ^ ^ "°**» 

output from the address register 12. The CONT 18 is ,„ PtT i^^*'^ ^..^^iS^iSJIS 
coupled to a Prefetch Block Length Counter ^"^^ 'T^^/T ^LSS.^"^™^ 
(LI^GTH) 16 by a plurality of signal Hnes. A first "^.^ ^lodc 0 to. EXTT. 
signal line U) mcrements Ae Len^Counter iH ^ ^P' 

second signal Une (-) decrements tteLength Coun^ '^^} v^?^™ ^ ''^t determination 
16.andaSsignLlline(H)HoldstheLenSSJlSS 35 ElS^ % ^il^S th^p t """^^ 
16 at a current count. It is noted that the iteration of ^^ ? 

the HOLD signal line may be accompSted^Sby ^T^^^'J'X^f^, ^ v^^^ 

not activatn«^wr the (+) ^^^ZZ^^ *f ^ *^ T"* 

An ontomt of die CTStrc \»Jr^)hJ ^MA^L, recently prefetched data block. Control then transfers 
!: VI " to Block J where the Demanded data block is returned 

^^^^^^ ^■^a'^'^"^°" « fron.theDCACHE7totherequestor. AtBlocTHe 
^L?nX^SiSS?Tw "^Z T HOLD signal line is asserted ^as to maintain L at the 

^^J^i^I^P ^"^^^ ««™nt value (L=L). Alternately, neither the (+) or 

SISf H^f?!-^ ^ ' (-) signal Imes ie activated; thereby mainidiing 

St?^/S Ta r -"?*^ °\ ^ ^ Length Counter 16 at the c3 

^J^i^^D^^^T^AC^r im' to Block 0 to EXIT. As a 

rtl^* DCACHE or HIT mult, for the case where the Demanded data is resident 

^^^^in^l^^'^ unmediately to ^ the DCACHE 7, but was not the most recently pre- 

^e requestor CPU 2), without requuing a ^ access. fetched, it is mdic;ted that some se^oS ^ 
^Ar^TrS?^^!,^^ databk)ckaccessesisstffloccuning1Sd.a8a^S^ 
TCACHE 7 G^SS asserted), flie CONT 18 untB^ 50 current nmnber (L) of prefetched daS bloctob not 
I/O Process to retneve the Demand data block. The dianged " " 

CONT 18 also operate^w is described below in rela- if the determination at Bkick I is NO, indicatmg a 
tion to the flowchart of FIG. f to possibly prefetch one DCACHE 7 MISS, a determination is made at Block L 
" ^"t^ « tu^- a ma^nm number of if the value of the Length Counter 16 equals a predeter- 

prefetehed blods bemg specified by the content of the 53 mined minimum value (Ljcn). If NO, L is decremented 
LMgUi counter 16. by one at Block M, and at Block N the demanded data 

^^^'^^^°^:;^^,'^^0 6.atBl0GkA umt is fetched from the disk 4 and returned to the re- 

questor.IfYES at Block L. the Block Mis not executed 
SSinp^cSLSp^Bt^TJS?* f andcontrolpassesdirectlytoBlockN.Theopemtionof 

r"^??™^!*^ ^ ^ *e correspond- 60 Block N is combined with the operation of Block O to 

mg DCACHE7dataBretumed to the requestor (Block perform one or more VO Ftecesses to also prefetch up 
C). At Block Da determmation IS made if the content to L-next data units. 

(L) of the Length Counter 16 is equal to a predeter- That is, and referring to HG. 2a for the case where L 
mned maxunum count (^uax)- If NO, the content (L) equals 3, If the next L data blocks (B,+i to B/+3) happen 
the Length Counter 16 is incremented by 1 (Block 65 to be physicaUy contiguous to one another and to 
^ and control passes to Blodi F. If YES at Block D, Blockt then one I/O Process is executed to read the 
the Length Counter 16 is not incremented, and control Demanded Block (Bd and also the three prefetched 
passes to Block F. bkxdcs. However, if the blocks are not contiguously 



01/12/2004, EAST Version: 1.4.1 



5,410,653 

7 8 

stored on the disk 4, and in accordance with an aspect of indicated by the MRRS 7a. The Demanded data block 

the invention illustrated m FIG. 2c, the CONT 18 per- is returned from the DCACHE 7 to the requestor ^G. 

forms two I/O Processes, wherein the Demanded block 6, Block C), L is incremented to 3, and, in that this is the 

is filched in conjunction with the physicaUy contiguous first MRRS HIT, a third prefetch (PF3) is performed to 

prefetched blocks Qf any). The Demanded block is 5 retrieve the three file data blocks As, A6 and A?. The 

stored in the DCACHE 7 and is also returned to the previonsMRRSentry for Asand A4ismoved tooneof 

requestor. A second I/O Process is then executed to the ENTRIES 76, and the MRRS 7a is updated to store, 

prefetch the remaining, non-contiguous data blocks Aj, A£, and A?. At this point, the DCACHE 7 stores 

(B/+2 and B/+3). As a result, the requestor is not re- four prefetched data blocks in anticipation of sequential 

quired to wait for the typically longer disk access cycle 10 Demands from the requestor, 

that is required to retrieve the non-contiguous blocks of A fourth Demand (D4) for A4 results m a MRRS 

file data. MISS mdication, but a DCACHE HIT (FIG. 6, Blocks 

The information specifying the physical locations of I and J). As a result, the Demanded data block (DADD- 

the file data blocks is obtained by the CONT 18 from R« A4) is returned to the requestor from the DCACHE 

the associated file descriptor information in a manner 15 7 and die value of L is held at three (FIG. 6, Block K). 

known in the art. A fifth Demand (D5) once more results in a MRRS 

FIG. 4 is an example of the operation of the invention HIT indication (FIG. 6, Block B), in that the block A5 

as shown in FIGS. 3 and 6, and illustrates a case of a was one of the most recent file data blocks to be pre- 

**purely sequential" file data access wherein the re- fetched, as indicated by the MRRS la. The Demanded 

questor Demands a plurality of logically contiguous 20 data block is returned from the DCACHE 7 to the 

data blocks. It should be noted that in FIG. 4, and also requestor (FIG. 6, Block C) and L is incremented to 4. 

in FIG. 5 described below, that the data blocks are In that this is the first MRRS HIT a fourth prefetch 

shown in a logically consecutive or contiguous manner, (PF4) is performed to retrieve file data blocks As 

with each data block having a block address (AiOasso- through An. The previous MRRS entry for As-A7is 

dated therewith. In practice, the data blocks may be 25 moved to one of the ENTRIES 76, and the MRRS 7a is 

physically non-consecutive or non-contiguous, as de- updated to store As-An. At this point, the DCACHE 7 

picted in FIG. 2c, Thus, it should be realized that the stores six prefetched d^ blocks m anticipation of fnr- 

prefetching of one or more file data blocks may require ther sequential Demands from the requestor, 

the issuance of two or more separate I/O Processes, as The next two Demands (D^ and D7) both MISS the 

described above. Also, in FIGS. 4 and 5 hjumvi as- 30 MRRS 7<i but HIT one of the DCACHE ENTRIES 76. 

sumed to be equal to one, Lju^is assumed to be equal As a result, the demanded data blocks are returned to 

to four, and the increment and decrement values for L the requestor from the DCACHE 7 and the value of L 

are each assumed to be one. In other embodiments of is held at four. 

the invention Lmin and Lmax may have other values, An eighth Demand (Dg) once more results in a 

and the increment value and decrement value for L may 35 MRRS HIT indication (FIG. 6, Block B), m that the 

be other than one, may not be predetermined static block Ag was one of the most recent file data blocks to 

values, and need not be equal to one another. be prefetched, as indicated by the MRRS 7a. The De- 

Furthermore, in FIGS. 4 and 5 each data unit or manded data block is returned from the DCACHE 7 to 

blockisconsideredtobeof equal size. By example only, the requestor (FIG. 6, Block Q. The comparison at 

if each block has 1000 bytes, then a first block is desig- 40 Block D of FIG. 6 indicates that l^^'LmXf and L is 

nated Ao (bytes CiOOO-0999), a second logically consecu- thus held (not incremented or decremented). In that this 

five block is designated Ai (bytes 1000-1999), the third is the first MRRS HIT, a fifth prefetch (^FS) is per- 

logicaUy consecutive block is designated A2 (bytes formed to retrieve the next four file data blocks A12 

2000-2999), etc. As can be seen, for block An, may be through A15. The previous MRRS entry for Ag-An is 

derived by truncating the three least significant digits of 45 moved to one of the ENTRIES lb, and the MRRS 7a is 

the byte address range within the block. updated to store A12-A1S. At this point, the DCACHE 

In FIG. 4 a first Demand ^1) having a DADD- 7 once again stores six prefetched data blocks in antici- 

Rs Ai results in a first file data block bemg fetched (F) pation of more sequentM Demands from the requestor, 

and a first Prefetch (PFl) occunmg. FFl employs an As can be seen, the operation of the method of the 

initial value for L, in this case one, to prefetch the file SO invention provides optimum prefetching for sequential 

data block beginmng at A2. After the F and PFl I/O file data block accesses, without prefetching inordinate 

Process(es), the file data blocks associated with the F amounts of disk data. 

and FFl are located in the DCACHE 7, and the MRRS FIG. 5 depicts the operation of the method of the 

la is updated to store the address A2. invention for the reception of logically non-sequential 

A second Demand (D^) results in a MRRS HIT mdi- 55 Demands. The response to Di and D2 is as described 
cation (FIG. 6, Block B), in that the block A2 was the above in reference to FIG. 4. The third Demand (D3) 
most recent file data block to be prefetched. The De- can be seen to request the file data block A^ This results 
manded data block is returned from the DC ACHE 7 to in both a MRRS and a DCACHE MISS (FIG. 6, 
^e requestor (FIG. 6, Block C) and, in that 1, L is Blocks I, L, M, N). At the occurrence of D3, L=2 as 
incremented to 2 (FIG. 6, Blocks E and D). Also, in that 60 described above in reference to FIG. 4. As a result of 
this is the first MRRS HIT, a second prefetch ^F2) is the MISS, L is decremented to 1, the file data block Aa 
performed to retrieve file data blocks A3 and A4 (FIG. is Fetched (F), and the file data block A7 is Prefetched 
6, Blocks F and G). The previous MRRS entry for A2 (PF3). The previous content of the MRRS la is moved 
is moved to one of ENTRIES lb, and the MRRS 7a is to one of the ENTRIES lb, and the MRRS la is up- 
updated to contain A3 and A4 (FIG. 6, Block H). 65 dated to store A7. 

A third Demand (D3) results in a MRRS HIT indica- Demands D4 and D5 both HIT the MRRS, resulting 

tion (FIG. 6, Block B), in that the block A3 was one of in L being mcremented to two and thra to three in 

the most recent file data blocks to be prefetched, as conjunction with Prefetches PF4 and PF5. Demand D6 



01/12/2004, EAST Version: 1.4.1 



5,410,653 



MISSes the MRRS la but HTTs one of the DCACHE 
ENTRIES lb, resultiiig L being held at three. 

At the occurrence of the non-sequential Demand D? 
the MRRS la stores Aio, An and A12. The DADDR 
for Dyis eqnal.to An- As a result, the MRRS la is HIT S 
(FIG; 6, Blocks B, C, D, £, F, G, and H). This results 
in L bebig inCTemented to four and the FF6 occurxing 
to prefetch the four file data blocks An-Ai6. 

If the Demand following D6 is instead the non- 
sequential Demand Ds', the MRRS la is MISSed and 10 
one of the DCACHE ENTRIES is HIT. As a result, 
the file data block A4 is returned to the requestor from 
DCACHE 7, the value of L is held at 3, and the con- 
tents of the MRRS la remains imchanged. 

If a next Demand after D7 (D?*) has a DADDR=: As, 15 
both the MRRS la and the DCACHE ENTRIES lb 
are MISSed. As a result, the Block As is Fetched (F), L 
is decremented by one, and a PF7 may be executed to 
prefetch file data blocks A6, A? and As. 

If it is assumed for this example that Aa, A7 and Ag are 20 
still resident within the DCACHE 7» as a result of the 
execution of D3 and D4, then Block G of FIG. 6 is 
modified to not prefetch the next L blocks. That is, the 
I/O Process (PF7) is not required. Not performing the 
prefetch for already-resident data blocks improves the 25 
overall efficiency of the DCACHE 7. It should be 
noted that even though the prefetch may not occur, the 
MRRS 7a is still updated at Blodc H of FIG. 6 to mdi- 
cate data blocks A^As. 

Furthermore, a condition may arise where, by exam- 30 
pl^ blodcs A6 and As are resident m the di^ cache 7, 
but block A7 is not For this case, and in accordance 
with an aspect of the invention, a fkst I/O Process is 
initiated to Fetch and return the Demanded data block 
As, and then a second I/O Process is initiated to Pre- 35 
fetch the physically and logically non-contiguous block 
A7. That is, although the current value of L is 3, due to 
the MISS resulting from Demand D7', the CONT 18 is 
not required to prefetch three blocks. Thus, the value of 
L represents a tnaTimntn number of data blocks that 40 
may be prefetched, but does not dictate the actual num- 
ber of prefetched blocks for every case. 

In the foregoing examples it has been assumed that 
when a Demand occurs, such as D6 in FIG. 4, that the 
Prefetched data blocks ^F4) are already encached and 45 
that the MRRS has been updated. However, this is not 
always the case. Considering FIG. 4 once more, the 
Demand 5 (Ds) initiates the execution of PF4. In the 
foregoing description, the receipt of Decaused a MRRS 
MISS, but a DCACHE HIT, in that it was assumed that 50 
the MRRS had already been iqidated as a result of the 
execution of PF4. If, however, D^ and possibly also D7, 
occurs before the oomqpletion of PF4, then the MRRS 
still contains the information firom PF3, Le., As, A^and 
Ay: This condition results in the test at Block F of FIG. 55 
6 i n d ica t in g NO, and a transfer of control to Block 0 to 
EXrr. As a result, for other than the first HIT of the 
MRRS, another Prefetch operation is not executed or 
scheduled for execution. As was noted above, the value 
of L is still incremented regardless of the subsequent 60 
outcome of the test at FIG. 6, Block F indicatmg a first 
or a subsequent HIT of the MRRS. 

It can be appreciated that at some point one or more 
DCACHE ENTRIES may need to be removed to make 
room for new file data. This may be accomplished with 65 
a well known Least Recently Used (LRU) technique. A 
presently preferred method for mgnflgifig the replace- 
ment operation of the DCACHE 7 is described in com- 
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monly assigned U.S. patent application Ser. No. 
07/688,732, filed Apr. 19, 1991, entitled "Ifit-Density- 
Based Replacement for Data Cadie With Pr^etching**. 

While the invention has been particularly shown and 
described with respect to a preferred embodiment 
thereof, it will be understood by those skilled m the art 
that changes in form and details may be made therein 
without departing firom the scope and spirit of the in- 
vention. 

Having thus described our invention, what we claim 
as new, and desire to secure by Letters Patent is: 

1. A method for operating a data cache that is cou- 
pled to a data store, comprismg the steps of: 

recdvmg, from a requestor, a Demand for a unit of 
data, the unit of data being a constituent unit of 
data of a file comprised of a plurality of logically 
sequential units of data; 

determining from one or more data cache entries if 
the Demanded unit of data is resident in said data 
cadxe; and 

if the Demanded unit of data is determined to be 
resident in the data cache, returning the Demanded 
unit of data from the data cache to the requestor, 
wherem the step of determining includes the steps 
of, 

determiiiing from a predetermined one of the data 
cache entries if the Demanded unit of data is a unit 
of data that was most recently prefetched into the 
data cache from the data store, and, if so, 

prefetching up to L further units of data firom the data 
store to the data cache, wherein L represents a 
content of a Prefetch Block Length Counter; 
wherein, 

if the step of determining from one or more data 
cache entries if the Demanded unit of data is resi- 
dent in the data cache indicates that die Demanded 
unit of data is not resident in the data cache, the 
method executes instead the steps of: 

decreasing the value of L by a decrease increment; 
and 

fetching the Demanded unit of data into the data 
cache while also prefetching up to L further units 
of data from the data store to the data cache. 

2. A method as set forth in claim 1 wherein the step of 
prefetching prefetches up to L further units of data that 
are logically contiguous to the Demanded unit of data. 

3. A method as set forth in claim 2 wherein the step of 
prefetching includes a step of updating a predetermined 
one of the data cache entries to indicate the identity of 
the prefetched units of data as being the most recently 
prefetched units of data. 

4. A method as set forth in claim 1 wherem* if the 
prefetched further units of data are physically contigu- 
ous to one another and to the Demanded unit of data, 
within the data store, the step of fetching and prefetch- 
ing is accomplished with one I/O Process. 

5. A method as set forth in claim 1 wherein, if the 
prefetched further units of data are not physically con- 
tiguous to one another or to the Demanded unit of data, 
within the data store, the step of fetching and prefetch- 
mg is accomplished with two or more I/O Processes, 
wherein a first I/O Process fetches the Demanded unit 
of data and all of the up to L units of data that are 
physically contiguous with the Demanded unit of data, 
if any, and wherem a second I/O Process prefetches 
one or more non-physically oontiguons others of the up 
to L units of data. 
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6. A method for opeiatiqg a data cache that is cou* 
pled to a data store, comprising the steps of: 

receiving, from a requestor, a Demand for a unit of 
data; 

determining from one or more data cache entries if 5 
the Demanded unit of data is resident in said data 
cach^ and 

if the Demanded unit of data is determmed to be 
resident in the data cache, returning the Demanded 
unit of data from the data cache to the requestor, 10 
wherein the step of determinmg includes the steps 
of. 

determining from a predetermined one of the data 
cache entries if the Demanded unit of data is a unit 
of data that was most recently prefetched into the IS 
data cache from the data store, and, if so, 
increasing a value L of a prefetch unit counter, 
determining if a previously Demanded unit of data 
was also a unit of data that was most recently pre- 
fetched into the data cache from the data store; 20 
and, if not, 

prefetching up to L units of data from the data store 
to the data cache, the up to L prefetched units of 
data being logically contiguous to a last prefetched 
unit of data; and 25 

updating the predetermined one of the data cache 
entries to mdicate the identities of the prefetched 
units of data; otherwise 

if the step of determining from one or more data 
cache entries if the Demanded unit of data is red- 30 
dent in a data cache indicates that the Demanded 
unit of data is not resident in the data cache, the 
m^od executes instead the steps of: 

decreasing the value of L; 

fetching the Demanded imit of data into the data 35 
cache while also prefetching up to L units of data 
from the data store to the data cache, the up to L 
units of data being logically contiguous to the De- 
manded unit of data; and 

updating the predetermined one of the data cache 40 
entries to indicate the identities of the prefetched 
units of data. 

7. A method as set forth in claim 1 wherein, if the step 
of determining from a predetermined one of the data 
cache entries ^ the Demanded unit of data is a unit of 45 
data that was most recendy prefetched into the data 
cache from the data store indicates no, the value of L is 
held at a current value. 

8. A method as set .forth in claim 1 wherein, if the iq> 

to L frirther units of data are physically contiguous to 50 
one another and to the Demanded nnit of data, within 
the data store, the step of fetching and prefetching is 
accomplished with one I/O Process. 

9. A method as set forth in claim 1 wherein, if the up 
to L further units of data are not physically contiguous 55 
to one another or to the Demanded unit of data, within 
the data store, the step of fetching and prefetching is 
accomplished with two or more I/O Processes, wherdn 

a first I/O Process fetches the Demanded unit of data 



non-physically contiguous others of the up to L units of 
data. 

10. A digital data processor comprised of a data cache 
that b coupled to a data store, said data processor fur- 
ther comprising: 

means, coupled to said data cache and to said data 
store, for prefetching units of data from the data 
store to the data cache; 

means for receiving a Demand for a unit of data; 

first means for determining from one or more data 
cache entries if the Demanded unit of data is resi- 
dent in said data cache; 

means, responsive to the Demanded unit of data 
being determined to be resident in the data cache, 
for returning the Demanded unit of data from the 
data cache; 

second means for determining from a predetermined 
one of the data cache entries if the Demanded unit 
of data is a unit of data that was most recently 
prefetched into the data cache from the data store, 
and, if so, for increasing a value L of a prefetch unit 
counter means; 

third means for determining if a previously De- 
manded unit of data was also a unit of data that was 
most recently prefetched into the data cache from 
the data store; and, if not, causing said prefetching 
means to prefetch up to L units of data from the 
data store to the data cache; 

means for updating the predetermined one of the data 
cache entries to indicate the identities of the pre- 
fetched units of data; wherein 

the digital data processor further comprises means, 
responsive to the operation of the first deteanming 
means indicating that the Demanded unit of data is 
not resident in the data cache, for decreasing the 
value of L; for fetching the Demanded unit of data 
into the data cache and for causing said prefetching 
means to prefetch up to L units of data from the 
data store to the data cache; and for updating the 
predetermined one of the data cache entries to 
indicate the identities of the prefetched units of 
data; and 

means, responsive to the second determining means 
indicating that the Demanded imit of data is a unit 
of data that is resident in the data cache but is not 
a unit most recently prefetched into the data cache 
from the data store, for maintainmg the value of L 
at a current value. 

11. A digital data processor as set forth in claim 10 
wherein said means for prefetching is responsive to a 
condition wherein further units of data are not physi- 
cally contiguous to one another or to the Demanded 
unit of data, within the data store, for prefetching the L 
further units of data with two or more I/O Processes, 
wherein a first I/O Process fetches at least the De- 
manded unit of data and all of the up to L units of data 
that are physically contiguous with the Demanded unit, 
of data, if any, and wherein a second I/O Process pre- 



and aU of the up to L units of data that are physically 60 fetches one or more non-physically contiguous others 
contiguous with the Demanded unit of data, if any, and of the up to L units of data, 
wherem a second I/O Process prefetches one or more • • • * « 
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